Multi-Granularity Retrieval Model for Bridging Gaps between Biomedical Concepts and Entities: THUIR at TREC 2007 Genomics Track
نویسندگان
چکیده
Abstract General concepts are always used to describe query requirement (In the example “What tumor types are associated with Rb1 mutations?”, “Tumor types” is a general concept, and its entity in a relevant documents can be “brain tumor”). To bridge the gaps between concepts in user queries and entities in relevant documents, we proposed a multi-granularity retrieval model in TREC 2007 Genomics task. The model consists of three components: (1) Paragraph retrieval is employed to retrieve candidate paragraph initially; (2) Dictionary-based NER is utilized to recognize named entities of given types; (3) Passage ranking is used to rank retrieved candidate passages. Our proposed model achieve promising result (Passage MAP=0.1023, with NER bottleneck eliminated).
منابع مشابه
IIT TREC 2007 Genomics Track: Using Concept-Based Semantics in Context for Genomics Literature Passage Retrieval
For the TREC-2007 Genomics Track [1], we explore unsupervised techniques for extracting semantic information about biomedical concepts with a retrieval model for using these semantics in context to improve passage retrieval precision. Dependency grammar analysis is evaluated for boosting the rank of passages where complementary subject/object concept pairs can be identified between queries and ...
متن کاملLearning Domain-Specific Knowledge from Context--THUIR at TREC 2005 Genomics Track
We(Tsinghua University) participated both Ad Hoc Retrieval Task and Categorization Task in TREC2005 Genomics Track, in which we designed and implemented a serious of methods encompassed learning domain-specific knowledge from context. In Ad Hoc Retrieval Task, internal resource is introduced to expand query, different granularity indexing provides more flexible retrieval space, and pattern disc...
متن کاملIIT TREC 2006: Genomics Track
For the TREC-2006 Genomics Track, we report on the effectiveness of composite information retrieval functions based on a dimensional data model for improving document, passage, and aspect search precision of genomics literature. We designed an approach, and developed a corresponding search engine, based on a novel dimensional data model capable of document, paragraph, sentence, and passage leve...
متن کاملTHUIR at TREC 2004: Genomics Track
This is the first time that THUIR participates in TREC Genomics Track. We took part in both Ad hoc retrieval task and Categorization task. Based on our retrieval system TMiner, our research in the Ad hoc retrieval task focuses on: (1) Category of organism retrieval strategy; (2) Primary Feature Model; (3) Query Expansion (QE) technology; (4) Result fusion method. Five official runs have been su...
متن کاملDUTIR at TREC 2007 Genomics Track
This paper describes our experiments on TREC 2007 Genomics Track which is concerned with question answering extraction from full-text biomedical literatures. In our experiment, named entities were recognized at the preprocessing stage using a two-view method. MeSH was used to expand the terms. We performed passage retrieval by using sentence-level half overlapped sliding windows. Indri structur...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007